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DETAILED ACTION 
Claim Rejections - 35 USC § 103 

1 . The following is a quotation of 35 U.S.C. 103(a) which forms the basis for all 
obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set 
forth in section 102 of this title, if the differences between the subject matter sought to be patented and 
the prior art are such that the subject matter as a whole would have been obvious at the time the 
invention was made to a person having ordinary skill in the art to which said subject matter pertains. 
Patentability shall not be negatived by the manner in which the Invention was made. 

2. Claims 1-18 and 20-21 are rejected under 35 U.S.C. 103(a) as being 
unpatentable over Panusopone (6,483,874) in view of Carlbom (7,203,693). 

Regarding claim 1 , Panusopone discloses a system for encoding a plurality of 
videos acquired of a moving object in a scene by a plurality of fixed cameras, 
comprising: 

means for determining a segmentation mask for each frame of each video, the 
segmentation mask identifying only pixels in the frame associated with the object 
(col.5, ln.29-42, segmentation masks are used for video frames that identify pixels 
associated with the video object); 

a shape encoder configured to encode the segmentation masks (fig.2, element 
210, col.5. ln.47-48); 

a position encoder configured to encode a position of each pixel (fig.2, element 
220, col.5, ln.48-50, motion data is coded to code a position of each pixel in a frame); 
and 

a color encoder configured to encode a color of each pixel (fig.2, element 240, 
col.5, ln.50-53). 
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Panusopone does not specifically disclose means for determining camera 
calibration data of each camera and means for associating the camera calibration data 
of each camera with the video acquired by the camera. However, Carlbom teaches 
means for determining camera calibration data of each camera {fig.2B, Carlbom 
discloses element 252 that determines the camera calibration data, see col.8, ln.56-61 ) 
and means for associating the camera calibration data of each camera with the video 
acquired by the camera (col.8. ln.56-61 ; Carlbom discloses that each camera has a 
unique identifier, ie. a camera ID, and its calibration parameters associated with 3D 
position, dimensions, orientation, zoom, focus and viewing volume). Therefore, it 
would have been obvious to one of ordinary skill in the art to combine the teachings of 
Panusopone and Carlbom, as a whole, for producing high quality images of an 
observed real world event (Carlbom col.2, ln.28-31). 

Regarding claim 2, Panusopone discloses the multiplexer for combining outputs 
of the shape encoder, the position encoder, and the color encoder into a single 
bitstream (fig.2, element 280). 

Regarding claims 3-5 and 12-14, Panusopone discloses further comprising: a 
decoder (fig.3, element 300); means for transferring the bitstream to the decoder (fig.2, 
element 280 multiplexes the bitstream and prepares the transportation of the bitstream 
to send to decoder 300 on fig.3). Panusopone does not specifically disclose rendering 
a decoded bitstream from an arbitrary viewpoint using the camera calibration data. 
However, Carlbom teaches the use of camera calibration data (col.8, ln.56-61 ; 
Carlbom discloses that each camera has a unique identifier, ie. a camera ID, and its 
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calibration parameters associated with 3D position, dimensions, orientation, zoom, 
focus and viewing volume). Therefore, it would have been obvious to one of ordinary 
skill in the art to combine the teachings of Panusopone and Carlbom, as a whole, for 
producing high quality images of an observed real world event (Carlbom col.2, ln,28- 
31). 

Regarding claims 6 and 1 1 , Panusopone does not disclose further comprising: 
means for maintaining a dynamic 3D point model defining a geometry of the moving 
object. However, Carlbom teaches the use of a dynamic 3D model (col.9, ln.22-43). 
Therefore, it would have been obvious to one of ordinary skill in the art to combine the 
teachings of Panusopone and Carlbom, as a whole, for producing high quality images 
of an observed real world event (Carlbom col.2, ln.28-31). 

Regarding claim 7, Panusopone does not specifically disclose in which each 
point of the dynamic 3D point model is associated with an identifier of one or more of 
the plurality of cameras. However, Carlbom teaches in which each point of the 
dynamic 3D point model is associated with an identifier of one or more of the plurality 
of cameras (col,8, in. 56-61 ; Carlbom discloses that each camera has a unique 
identifier, ie. a camera ID, and its calibration parameters associated with 3D position, 
dimensions, orientation, zoom, focus and viewing volume). Therefore, it would have 
been obvious to one of ordinary skill in the art to combine the teachings of Panusopone 
and Carlbom, as a whole, for producing high quality images of an observed real world 
event (Carlbom col.2, ln.28-31). 
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Regarding claims 8. 10, 16 and 17, Panusopone discloses in which the 
segmentation masks are encoded using l\/IPEG-4 lossless binary shape encoding, the 
positions include depth values encoded as quantized pixel luminance values, and the 
colors are encoded using MPEG-4 video object coding (col.4, ln.58 to col.5. In. 12). 

Regarding claim 9, Panusopone does not specifically disclose in which the 
camera calibration data are updated periodically when any of the fixed cameras are 
relocated. However, Carlbom teaches the camera calibration data are updated 
periodically when any of the fixed cameras are relocated (fig.4, element 412). 
Therefore, it would have been obvious to one of ordinary skill in the art to combine the 
teachings of Panusopone and Carlbom, as a whole, for producing high quality images 
of an observed real world event (Carlbom col.2, ln.28-31 ). 

Regarding claim 15, Panusopone does not specifically disclose in which a 
playback frame rate of the output video is different than a frame rate used to acquired 
the videos by the plurality of cameras. However, Carlbom teaches the playback frame 
rate of the output video is different than a frame rate used to acquire the videos by the 
plurality of cameras (fig.1 , element 120 is a visualization interface can be used to 
playback the video at a different frame rate than the acquired image rate, ie. fast 
fonward or slow-motion). Therefore, it would have been obvious to one of ordinary skill 
in the art to combine the teachings of Panusopone and Carlbom, as a whole, for 
producing high quality images of an observed real world event (Carlbom col.2, ln.28- 
31) 
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Regarding claims 18 and 20-21 , Panusopone discloses further comprising: 
means for partitioning each video into a plurality of segments, each segment including 
a plurality of frames (col.6, ln.36-43, interframe coding involves a group of frames); and 
means for encoding a key frame and difference frames of each segment, using the 
shape encoder, the position encoder, and the color encoder into a single bitstream 
(col.6, ln.61 to col.7, ln.6 and col.7, ln.24-26, Panusopone disclose the encoding of the 
anchor frame or the key frame and the fonward and backward MVs are used for 
obtaining the difference frames, and in fig.2, note the use of a multiplexer 280 to merge 
the data into a single bitstream from the the shape encoder 210, position encoder 220 
and color encoder 240). 

Claims 19 is rejected under 35 U.S.C. 103(a) as being unpatentable over 
Panusopone (6.483,874) and Carlbom (7,203,693) in further view of Wu (6,614,936). 

Panusopone and Carlbom do not disclose in which the key frames comprise a 
base layer of an encoded video bitstream, and the difference frames comprise an 
enhancement layer of the encoded bitstream. However, Wu teaches the base layer 
(fig.9, element 82) and the enhancement layer (fig.9, element 84). Therefore, it would 
have been obvious to one of ordinary skill in the art to combine the teachings of 
Panusopone, Carlbom and Wu, as a whole, for accurately, efficiently encoding images 
in a high quality manner. 

Claims 22 and 24 are rejected under 35 U.S.C. 103(a) as being unpatentable 
over Panusopone (6.483,874) and Carlbom (7.203,693) in further view of Rusinkiewicz 
("Qsplat: A multiresolution point rendering system for large meshes" In SIGGRAPH 
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2000 Conference Proceedings, ACIVI Siggraph Annual Conference Series, pages 343- 
352, 2000). 

Panusopone discloses the encoder used to encode images (fig.2). Carlbom 
discloses the surface normal encoder (col.9, ln.44-55; VRML and MPEG-4 were 
implemented), Panusopone and Carlbom do not specifically disclose the splat size 
encoder. However, Rusinkiewicz teaches the use of splat size encoder (page 344, 
section 2,1 Rendering Algorithms, the Qsplat uses a recursive frame rate encoding 
scheme for splat size encoding). Therefore, it would have been obvious to one of 
ordinary skill in the art to combine the teachings of Panusopone, Carlbom and 
Rusinkiewicz, as a whole, for efficiently encoding/decoding image data so as to 
accurately produce high quality images. 

Allowable Subject Matter 
3. Claim 23 is objected to as being dependent upon a rejected base claim, but 
would be allowable if rewritten in independent form including all of the limitations of the 
base claim and any intervening claims. 

The following is a statement of reasons for the indication of allowable subject 
matter: The prior art does not specifically disclose "the surface normal vectors are 
progressively encoded using an octahedron subdivision of a unit sphere and the splat 
sizes are encoded as quantized codewords represented in a gray scale MPEG video 
object". 
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